PARC 3.0: A Corpus of Attribution Relations
نویسنده
چکیده
Quotation and opinion extraction, discourse and factuality have all partly addressed the annotation and identification of Attribution Relations. However, disjoint efforts have provided a partial and partly inaccurate picture of attribution and generated small or incomplete resources, thus limiting the applicability of machine learning approaches. This paper presents PARC 3.0, a large corpus fully annotated with attribution relations (ARs). The annotation scheme was tested with an inter-annotator agreement study showing satisfactory results for the identification of ARs and high agreement on the selection of the text spans corresponding to its constitutive elements: source, cue and content. The corpus, which comprises around 20k ARs, was used to investigate the range of structures that can express attribution. The results show a complex and varied relation of which the literature has addressed only a portion. PARC 3.0 is available for research use and can be used in a range of different studies to analyse attribution and validate assumptions as well as to develop supervised attribution extraction models.
منابع مشابه
The Penn Discourse TreeBank as a Resource for Natural Language Generation
While many advances have been made in Natural Language Generation (NLG), the scope of the field has been somewhat restricted because of the lack of annotated corpora from which properties of texts can be automatically acquired and applied towards the development of generation systems. In this paper, we describe how the Penn Discourse TreeBank (PDTB) can serve as a valuable large scale annotated...
متن کاملAssociation between mutations in gyrA and parC genes of Acinetobacter baumannii clinical isolates and ciprofloxacin resistance
Objective(s): We investigated the contribution of gyrA and parC mutational mechanism in decreased ciprofloxacin susceptibility of Acinetobacter baumannii isolated from burn wound infections. Materials and Methods: Ciprofloxacin susceptibility of 50 A. baumannii isolates was evaluated by disk diffusion and agar dilution methods. PCR and sequencing were performed for detection of mutation in gyr...
متن کاملAttribution: a computational approach
Our society is overwhelmed with an ever growing amount of information. Effective management of this information requires novel ways to filter and select the most relevant pieces of information. Some of this information can be associated with the source or sources expressing it. Sources and their relation to what they express affect information and whether we perceive it as relevant, biased or t...
متن کاملProduction of ultra cold neutrons by a doppler shifter with pulsed neutrons at J-PARC
Ultracold neutrons (UCNs) are neutrons whose kinetic energy is around a few hundred nanoelectronvolts. Neutrons with such small kinetic energy can be trapped in a material vessel or magnetic fields. Because of these unique characteristics, UCNs are used for some important experiments of fundamental physics. The Doppler shifter is a device to produce UCN by slowing them down by the reflection on...
متن کاملIs Violent Crime Increasing or Decreasing? a New Methodology to Measure Repeat Attacks Making Visible the Significance of Gender and Domestic Relations
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. IS VIOLENT CRIME INCREASING OR DECREASING? A NEW METHODOLOGY TO MEASURE REPEAT ATTACKS MAKING VISIBLE THE SIGNIFICANCE OF...
متن کامل